Similarity Reasoning and Filtration for Image-Text Matching

نویسندگان

چکیده

Image-text matching plays a critical role in bridging the vision and language, great progress has been made by exploiting global alignment between image sentence, or local alignments regions words. However, how to make most of these infer more accurate scores is still underexplored. In this paper, we propose novel Similarity Graph Reasoning Attention Filtration (SGRAF) network for image-text matching. Specifically, vector-based similarity representations are firstly learned characterize comprehensive manner, then (SGR) module relying on one graph convolutional neural introduced relation-aware similarities with both alignments. The (SAF) further developed integrate effectively selectively attending significant representative meanwhile casting aside interferences non-meaningful We demonstrate superiority proposed method achieving state-of-the-art performances Flickr30K MSCOCO datasets, good interpretability SGR SAF extensive qualitative experiments analyses.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of Similarity Measures for Template Matching

Image matching is a critical process in various photogrammetry, computer vision and remote sensing applications such as image registration, 3D model reconstruction, change detection, image fusion, pattern recognition, autonomous navigation, and digital elevation model (DEM) generation and orientation. The primary goal of the image matching process is to establish the correspondence between two ...

متن کامل

Local Similarity Measures for Multimodal Image Matching

In this paper we focus on local similarity measures based on Shannon entropy which can be used for multimodal image matching employing deformations. The advantage of our approach is that global similarity or similarity of a larger image region can be computed from the similarities of its constitutive parts or individual voxels. We also discuss the interpolation artefacts in entropy based simila...

متن کامل

Nonlinear Similarity Based Image Matching

Image matching is an inarguably important operation for many practical sophisticated systems in machine vision and medical diagnosis. Many gray-level image matching applications use the sum-of-squared-difference (SSD) or sum-ofabsolute-differences (SAD), which are very sensitive to noise. Almost all images have some kind of noise, which causes the matching tasks significantly difficulty. In thi...

متن کامل

Stacked Cross Attention for Image-Text Matching

In this paper, we study the problem of image-text matching. Inferring the latent semantic alignment between objects or other salient stuffs (e.g. snow, sky, lawn) and the corresponding words in sentences allows to capture fine-grained interplay between vision and language, and makes image-text matching more interpretable. Prior works either simply aggregate the similarity of all possible pairs ...

متن کامل

Text Matching as Image Recognition

Matching two texts is a fundamental problem in many natural language processing tasks. An effective way is to extract meaningful matching patterns from words, phrases, and sentences to produce the matching score. Inspired by the success of convolutional neural network in image recognition, where neurons can capture many complicated patterns based on the extracted elementary visual patterns such...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v35i2.16209